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Several Remarks on Dissimilarities and 
Ultrametrics 


Dan A. Stmovict! 


Abstract 


We investigate the relationships between tolerance relations, equiv- 
alence relations, and ultrametrics. The set of spheres associated to 
an ultrametric space has a tree structure that reflects a hierarchy on 
the set of equivalences associated to that space. We show that ev- 
ery ultrametric defined on a finite space is a linear combination of 
binary ultrametric and we introduce the notion of ultrametricity for 
dissimilarities, which has applications in many data mining problems. 
Keywords: tolerance, equivalence, ultrametric, ultrametricity, tree of 
spheres 


1 Introduction 


Dissimilarity spaces constitute the natural framework for a number of ex- 
ploratory techniques in machine learning and data mining such as certain 
classification methods and clustering algorithms. We examine relationships 
that exist between various types of dissimilarities and focus on ultrametrics. 

Ultrametrics are dissimilarities that satisfy a stronger version of the 
triangular inequality (usually associated with metrics) and they occur in 
many data mining applications such as agglomerative hierarchical clustering 
algorithms [4, 5, 2, 6], and have applications in the study of phylogenetic 
trees in biology [10, 7], p-adic numbers in mathematics [12, 1], and certain 
physical systems [11], etc. 
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We evaluate the extent of the difference between dissimilarities and 
ultrametrics defined on the same set by introducing the notion of ultra- 
metricity of a dissimilarity. We show that dissimilarities can be modified to 
increase or decrease their level of ultrametricity. An increase in ultrametric- 
ity has an equalizing effect on dissimilarities and can be useful in certain 
clustering algorithms; a decrease in ultrametricity is interesting for other 
data mining applications such as the k-nearest neighbor technique and in 
outlier detection, as we have shown in [14]. 

The set of real numbers is denoted by R; the set of non-negative reals 
is denoted by Ryo. Every other set considered in below is finite. 

A quasi-dissimilarity on a set S is a function d: S' x S —>R such that 
d(x, y) > 0, d(x, x) = 0, and d(z, y) = d(y, x) for every x,y € X. We assume 
that all dissimilarity spaces considered here are finite. 

A quasi-dissimilarity d is a dissimilarity if d(x, y) = 0 implies x = y. 

A quasi-dissimilarity d is a quasi-metric if it satisfies the triangular 
inequality: 

d(x,y) < d(x, z) + d(z,y). (1) 


In addition, if d(a,y) = 0 implies x = y, then d is a metric. Inequality (1) is 
known as the triangular inequality. 

A quasi-ultrametric is a quasi-dissimilarity d : S x S —> Rso that 
satisfies the inequality 


d(x, y) < max{d(a, z),d(z,y)} (2) 


for every x,y,z € S. If, in addition, d(x, y) = 0 implies x = y, then d is an 
ultrametric. 

In Section 2 we discuss the link between ultrametrics and equivalence 
relations and we include some preliminary results. In Section 3 we examine 
properties of the collection of spheres of an ultrametric space. The relation- 
ship between multivalued and binary ultrametrics is the object of Section 4. 
In Section 5 we introduce the notion of ultrametricity of dissimilarities. 
Finally, we present conclusions in Section 6. 


2 Ultrametrics and Equivalences 
A tolerance on the set S is a relation 6 C S x § that is reflexive and 


symmetric. In other words, (x, x) € @ for every x € S and (z,y) € 6 if and 
only if (y,z) € 0 for x,y € S. The set of tolerances on S is denoted by 
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TOL(S). A tolerance on S$ that is transitive (that is, (x,y), (y,z) € 6 imply 
(x,z) € @) is an equivalence. The set of equivalences on S' is denoted by 
EQ(S). 

Let d be a quasi-dissimilarity on the set S. It is immediate that the 
relation 


bar = {(2,y) | d(x, y) < r} 


is a tolerance on S for every r € R. 

If d is an ultrametric on S, the relation 6g, is an equivalence on S' for 
any r € Ryo. Indeed, if (x,y), (y, z) € 6a, then d(x, y) <r and d(y,z) <r. 
Therefore, 

d(x, z) < max{d(z,y),d(y,z)} <r 


because of the ultrametric inequality. Thus, (2, z) € 94,, which proves that 
6a, is transitive, so it is an equivalence. 
The equivalence relations wg and wg on a set S are defined by: 


ag={(#,r) | ve Sbandwg=SxS. 


Let d: S x S —+ Ryo be a dissimilarity on S whose range is {0, L}. 
We designate such functions as binary dissimilarities. Note that d(x,y) < 
max{d(x, z),d(z,y)} for x,y,z € S. Indeed, if d(x, y) = 0, the ultrametric 
inequality is clearly satisfied. If d(x, y) = 1, then x ¥ y (since, otherwise 
we would have d(z,y) = 0). Thus, any z € S must be distinct either 
from x or from y, so at least one of the numbers d(2, z), d(z, y) is non-zero, 
and the ultrametric inequality is satisfied. We conclude that every binary 
dissimilarity is an ultrametric. 

Let 6 be an equivalence on S and let dg be defined as the characteristic 


function of @ as 
0 if (z,y) € 8, 
dg(x,y) = 
a(z,y) ‘ otherwise, 


for x,y € S. Then, dg is a quasi-ultrametric. Indeed, since (x,x) € 6 it 
follows that dg(x,x) = 0. If d(x, y) = 1, then (2, y) ¢ 0, so for every z € S 
we have (x, z) ¢ @ or (z,y) € 9. Thus, we have max{d(z, z),d(z,y)} =1, so 
the ultrametric inequality is satisfied by a, y, z. 

In particular, for da, we have 


0 ifv=y, 


dog (x,y) = . egelsy 
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for z,yEeS. 
If d is an ultrametric, the binary ultrametric dg, is given by 


bis ee 0 abet Ss 73 
L,y)= 
Hasnhad 1 otherwise 


for z,yES. 

Dissimilarities are closely related to certain families of tolerances. 

Let R be a subset of Ryo and let 6 : R? —> R be an associative 
operation on R. A {-directed family of tolerances on S is a collection 
Tg = {0, € TOL(S) | r © R}, where 6,,0, € T imply 6,65 © Agr,s) € T- Tp 
is a bounded family if there exists r € R such that 6, = wg. 


Example 2.1. Let d : S x S —+ Rso be a metric on the finite set S. 
Suppose that the range of d is Ran(d) = {ri,...,7m} and let 04, be the 
tolerance 


bar = {(z;y) | d(x, y) < ye 


If ry < rg < +++ < Tm the collection {@4, | r € Ran(d)} is a directed 
6-family of tolerances, where (a,b) = a+ b. Indeed, if (x,y) € 0474s, there 
exists t € S such that (x,t) € Oar, (t,y) € Aas, that is, d(a,t) < r and 
d(t, y) < s, which implies d(z,y) < r+ s. Thus, (x,y) € Oar+s- 

Conversely, if {0, | r € R} is a 6-family of tolerances, where (3 is 
defined as above, then d: S x S —+ Ryo defined by d(x, y) = min{r € R | 
d(x,y) € R} is a metric on S. 

Indeed, if d(x, z) = a and d(z,y) = b, then (x, z) € 0a, (z,y) € 4%, so 
(x,y) € 049% © Oa4b, which implies d(x, y) < a+6 = d(a,z)+d(z,y) 

Similarly, if 6 is replaced by G(a,b) = max{a,b}, then any (-directed 
family of tolerances defines an ultrametric, and every ultrametric can be 
obtained in this manner. 


3 Spheres in Ultrametric Spaces 


Let (S,d) be a dissimilarity space. The closed sphere centered in a and 
having radius r is the set 


Ba(xo,r) = {x € X | d(xo, xz) <r}. 


A triangle in a dissimilarity space (5, d) is a triple (x, y, z) € S°. To simplify 
the notation, we denote t = (x,y,z) by xyz. 
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The following properties of an ultrametric space (.S,d) are well-known 
(see [13]): 


(i) for every triangle t = xyz € S°, two of the numbers d(z, y), d(x, z), d(z, y) 
are equal and the third is not larger than the largest two numbers; 
thus, every triangle in an ultrametric space is isosceles; 


(ii) if B(xo, 4) m B(yo, a) # 0, then B(xo, 4) = B(yo, @); 


(iii) two spheres B(x, a) and B(yo, 6) are either disjoint or one of them is 
included in the other; 


(iv) every s € B(xo, a) is a center of the closed sphere B(9, a); 


(v) if « ¢ B(ao,r), then the distance from x to any point of the sphere 
B(xo,1) is the same. 


It is interesting to note that for an ultrametric the equivalence classes 
of 04, coincide with the spheres of the form B(x,r) because (a, y) € Aq, is 
equivalent to y € B(x,r). Thus, the ultrametric space (5, d) is partitioned 
by the set of spheres of radius r. This yields the quotient space S/@q,, whose 
elements are the spheres of radius r of (5, d). 


Theorem 3.1. Let d be an ultrametric (a quasi-ultrametric) on a finite set 
S. If |S| =n, then d takes at most n — 1 positive values. 


Proof. The proof is by induction on n > 2. The base case, n = 2 is immediate. 
Let n > 3 and suppose that the statement holds for m < n. 

Suppose that S = {21,22,...,2%,}. Without loss of generality we may 
assume that d(z1,72) = min{d(z,y) | 2,y € Sand d(z,y) > O}. Then, 
d(x, 21) = d(xz, £2) for all k such that 3<k <n. 

Let {a1,...,a,} = {d(az, 21) | 3< k < n}, where 0 < ap < ay <--- < 
ar. Define 

Be ={e,| b> sand day, 01) = az} 


forl <j <r. Thecollection B,,...,B, isa partition of the set {73,74,...,@n}. 
Let m; = |B;|; then my +--+ +m, =n—- 2. 

Ifu € B; and v € B; with i < j, then d(u,z1) = a < aj = d(v, 21), 
hence d(u,v) = aj. Therefore, the values of d(u,v) for u,v in distinct 
blocks of the partition belong to the set {a1,...,a-}. By the induction 
hypothesis, for each k, 1 < k <r, the restriction of d to Bg can take 
at most m; — 1 distinct positive values; therefore, d can take at most 
l+r+(m,—1)+---+(m,-—1)=n-—1 distinct positive values on S. 
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Table 1: Ultrametric defined by dendrogram 

XZ, | 2 | %3 |] Lg | V5 | VE | V7 | WB 
a} 0}2/2)3)3 )3) 3 +4 83 
v2} 2)/0/1;)3)3)3) 3 4 3 
vz} 2}1/0/ 3/3/34) 34) 8 
r4 | 3 3 | 3 0 1 1 2 2 
v5. | 280) Be Saf Le poO | bi ey 2 
tv} 3} 3/3] 1/21/04 24 2 
r7 | 3 3 | 3 2 2 2 0 1 
rg | 3 3 | 3 2 2 2 1 0 


Let (S,d) be a finite ultrametric space. Since the range of values of d, 
Ran(d) is a finite set that contains at most |S|— 1 values, the set SPH(S, d) 
of spheres of this space SPH(S,d) = {B(a,r) | « € S andr € Ran(d)} is 
finite. Consider the graph 75 q = (SPH(S), V) having SPH(S, d) as its set of 
vertices, where an edge (B(z,r), B(y, s)) exists if B(z,r) C Bly, s). 

Ts,a is a rooted tree. Indeed, if dj, = max{d(z,y) | x,y € S}, then the 
root of the tree is the sphere B(x,d,,), where x is an arbitrary element of 9. 
Since each sphere B(z,r) is included in B(x,d,,), it follows that the graph 
Ts.a is connected. Furthermore, 754 is acyclic. Indeed, if we would have a 
cycle 

B(x1,11) Cc B(x2,12) Gee 
CB Git tai) C Blair) = Ble) 


an immediate contradiction would follow because of the strict inclusions that 
exist between the spheres of this chain. The height of the tree of sphere 75,4 
of an ultrametric space (S,d) cannot exceed the number of distinct values of 


d. 


Example 3.2. For the ultrametric space defined by Table 1 the tree of 
spheres is shown in Figure 1. 


4 Binary and Multivalued Ultrametrics 


Let d,e be two dissimilarities on S. The dissimilarity e dominates d if 
d(xz,y) < e(x,y) for every x,y € S. We denote this by dC e. 


Theorem 4.1. The dissimilarity e dominates the dissimilarity d if and only 
if Oe,r © Oar for every r € Rso. 
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B(a;,,3) forl<i<¢8 


B(xa, 2) 
B(a5, 2) 
B(x6, 2) 
B(ag, 2) 


B(x1, 2) 
B(x2, 2) 
B(x3, 2) 


B(x4,1) 
B(x5,1) 
B(xg, 1) 


B(x7,1) 
B(xg,1) 


B(x2,1) 
B(a3,1) 


B(x1,0) 


B(a1.1) B(x2,0) B(x3,0) B(x4,0) B(#5,0) B(a6g,0) B(x7,0) B(xg,0) 


Figure 1: Tree of spheres 


Proof. Indeed, suppose that e dominates d. Let (x,y) € Oc, so e(@,y) <r. 
Since d(x, y) < e(x,y) <7, it follows that (x,y) € 04,7, 80 Der EC Oar- 


Conversely, suppose that 0. C 64, for every r € Ryo. Since (x,y) € 


Pe.e(x,y) F Fa,e(a,y) it follows that d(x, y) < e(x,y) for every x,y € S, that is, 
that e dominates d. 


Lemma 4.2. If B(xo,a), B(yo, a) are two disjoint spheres in an ultrametric 
space (S,d), then for x € B(xo,a) and y € B(yo,a) we have d(z,y) = 
d(xo, Yo): 


Proof. Note that d(ao, yo) > a because the spheres B(xo, a) and B(yo, a) are 
disjoint. Since d(x, 20) < a and the triangle xxoyo is isosceles, we must have 
d(x, yo) = d(xo, yo). On the other hand, the triangle xyoy is also isosceles 
and d(x, y) < a it follows that d(x, y) = d(x, yo), so d(x, y) = d(x0, yo). 


Theorem 4.3. Let (5, d) be an ultrametric space, where Ran(d) = {0,11,..-,%m}, 
O0<ry<-++<1Trm, andm<|S|—1. The quotient space S/64,, contains no 
more than n — k spheres. 


Proof. The positive distances between elements of S' located inside a sphere 
B(a,rz) range in the set {r,,...,r,}. If there are q, spheres of the form 
B(a,rx) there exist at most g, — 1 distinct values of d between the centers 
of these spheres. By Lemma 4.2 there are no more than k + qz — 1 values of 
the distance d between the points of S and, therefore, k + q,—-1<n-—1, 
which implies gq, <n —k. 
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We saw that starting from the characteristic function of an equivalence 
relation we can build an ultrametric whose range is the set {0,1}. The next 
statement gives a method of constructing ultrametrics starting from chains 
of equivalence relations. 


Theorem 4.4. Let S be a finite set and let d: S x S —+ Rso be a function 
whose range is Ran(d) = {r1,...,%m}, where r, = 0 such that d(x, y) = 0 if 
and only if x = y. 

The function d is an ultrametric on S if and only if the sequence of 
relations Oa7,,+++,9d,rm 18 an increasing chain of equivalences on S such 
that Oar, = ag and O47, = Ws. 


Proof. Suppose that d is an ultrametric on S. We have (a,x) € 0a, because 
d(x,a) = 0, so all relations 64,, are reflexive. Also, it is clear that the 
symmetry of d implies (x,y) € 0a,, if and only if (y,x) € @a7,, so these 
relations are symmetric. 

The ultrametric inequality is essential for proving the transitivity of 
the relations 04,,. If (x,y), (y,z) € 9a,r,;, then d(x, y) <r; and d(y,z) < ri, 
which implies d(x, z) < max{d(a, y),d(y, z)} <1r;. Thus, (a, z) € @4,7,, which 
shows that every relation 6q,, is transitive and therefore an equivalence. 

It is straightforward to see that 0a7, < @aryo < +++ < Oar,,; that is, this 
sequence of relations is indeed a chain of equivalences. 

Conversely, suppose that 0q7,,..-,9d,r, iS an increasing sequence of 
equivalences on S such that 0¢,, = ag and @¢,,, = ws, where 0¢,, = 
{(z,y)E SxS | d(z,y) <1r;} for 1 <i< mand r; =0. 

Note that d(x, y) = 0 is equivalent to (x,y) € Oar, = ag, that is, to 
L=y. 

We claim that 


d(z,y) = min{r | (x,y) € Oar}. (3) 


Indeed, since 0q,r,, = wg, it is clear that there is an equivalence 64, such 
that (a, y) € Oa,r,. If (x,y) € @a7,, the definition of 64,., implies d(a,y) < rj, 
so d(x,y) < min{r | (x,y) € 0a,}. This inequality can be easily seen to 
become an equality since (x,y) € Aa.a(z,y)- This implies immediately that d 
is symmetric. 

To prove that d satisfies the ultrametric inequality, let x,y, z be three 
members of the set S$. Let p = max{d(z, z),d(z,y)}. Since (x, z) € Oaaie,z) © 
Gap and (z,y) © Oaa(zy) S Pap, it follows that (x,y) € Aap, due to the 
transitivity of 0q,. Thus, d(z,y) < p= max{d(z, z),d(z,y)}, which proves 
the triangular inequality for d. 
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Theorem 4.5. Let S be a finite set and letd: S x S —+ Rso be a a quasi- 
ultrametric whose range is Ran(d) = {r1,...,%m}, where r; =0. Then d is 
a linear combination with positive coefficients of binary quasi-ultrametrics, 


d(x, y) = aid (x,y) + agdo(x,y) ++++ + @m—idm—1(2, y) 
forxz,yeS. 


Proof. Note that the range of d consists of m — 1 positive values, som < |5S| 
by Theorem 3.1. Assume that 0 = 11 < ro < +--+ < 1m. 

Consider the equivalences 6,,, for 1 <7 < m and the corresponding 
binary quasi-ultrametrics d; given by 


1 otherwise 


; cen 
ae, = {5 if d(z,y) <r. 


for (x,y) € S x S. Note that d,,(x, y) = 0 for every x,y € S. 
We claim that there exist m— 1 numbers ay,...,@m—1 such that 


d(x, y) = aid (x,y) oF azd2(x, y) ches 
+Am—1dm—1(2, y) 


for z,yEeS. 
Indeed, note that d(x, y) € {r1,...,1%m} by the definition of the range 
of d. Suppose that we have 


Tet ip < dey) He Sr Se << is 


This implies 


Ce, 0 ifk<igm 
(x,y) = 
Oe 1 otherwise, 


so Th = ay +--+ + agz_1 for 1 < k < m. This implies ay = ro,ag = 
73 —12,-+-,;A@m—1 =m — Tm—1 and we obtain the necessary equality. 


5 Ultrametricity of Dissimilarities 


Let Fy : R39 —+ Rso be the function defined by F,(a, b) = (a? + bP) p, where 
p>d0. 
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An p-dissimilarity is a dissimilarity d : S? —> S such that 
d(x,y) < F,(d(z, z), d(z,y)) (4) 


for x,y,z € S. We denote by D,(S) the collection of F,-dissimilarities 
on S. The ultrametricity of a dissimilarity space (S,d) is the number 
u(S,d) = sup{p € Ryo | d € Dy}. 

We note that every dissimilarity on a set S belongs to Do(S). Also, every 
dissimilarity in D1(S) satisfies the triangular axiom d(x, y) < d(, z)+d(z, y) 
for x,y, Z, so it is a metric on S. 


Lemma 5.1. Let p,q be two positive numbers. If p > q then we have 
F,(a,b) < Fa(a, b). 


Proof. Consider the function @ : Rs9 —> Rso given by 


$(p) = (a? +0P)?. 


We have 
b'(p) _ a?Ina+b?Inb In(a? + BP) 
(p) p(aP + bP) p 
Since oe In ap a In ar < 0 it follows that @' (p) < 0, which 


shows that ¢ is a decreasing function and this implies the statement of the 
lemma. 


Note also that for any p > 0 the function F;, is monotonic in each of its 
arguments. 
Lemma 5.1 implies that if p > q, then D,(S') C Dg(S). 


Theorem 5.2. Let d be a dissimilarity on a set S. We have d € ps0 D,. if 
and only if d is an ultrametric on S. 


Proof. If d € (\p59 Pr we have d(z,y) < (d(x,z)" + d(z,y)")r for every 
r >Oand z,y,z € S. Therefore, d(z,y) < lim,_,o(d(a, z)" + d(z,y)")* = 
max{d(x, z),d(z,y}, which implies that d is an ultrametric. 

Conversely, if dis an ultrametric, we have d(x, y) < max{d(z, z),d(z,y)} < 
(d(x, z)"+d(z, y)")r for every r > 0, sod € D, for every r > 0. Since d € Do, 
it follows that d € (),.s9 Dr- 


Theorem 5.3. Let r,s be two positive numbers such that s <r, letd € Dg. 
The family of r-dissimilarities dominated by d has a largest element. 
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Proof. Let D = {d; | 1 € I} be a collection of dissimilarities on a set S such 
that D C D,. Then, the dissimilarity d defined by d(x, y) = maxje; d;(z, y) 
for x,y € S belongs to D,. 

It is immediate that d itself is a dissimilarity on S and we have 


d(x,y) < di(x,y) < F,(di(z, z), di(z,y)) 
(for every i € I because d; € D,) 
< F,(d(x,z),d(z,y)) 
(because F;. is monotonic in both 


its arguments), 


so dé D,. 

Let e be a dissimilarity in D, and let D be the set of dissimilarities in 
D, dominated by e. If d'(z, y) = max{d € D,d < e}, then d’ <eandd<d' 
for every d € D, so d’ is the largest dissimilarity in D. 


Theorem 5.3 implies that given a dissimilarity d on a set S there exists 
a largest metric (in D1) that is dominated by d. Another, well-known result 
that follows from this theorem is the fact that given a dissimilarity, there 
exists the largest ultrametric that is dominated by this dissimilarity [8, 13]. 
This ultrametric is obtained by clustering the metric space using the single- 
link hierarchical clustering. 

Let r,s be two positive numbers. An (r, s)-transformation is a function 
g : Rso —> Rso such that 


(i) g(x) = 0 if and only if z = 0; 
(ii) g is a strictly monotonic function on Ryo, and 
(iii) g(a, b)) < Fs(g(a), g(6)) for a, 6 € Rso. 


Theorem 5.4. [fd is a dissimilarity on S such that d € D, and g is an 
(r, s)-transformation, then gd is a dissimilarity in Ds. 


Proof. Let d be an r-dissimilarity. It is immediate that gd is a dissimilarity. 
Since g is an (r, s)-transformation, we have 


Fs(g(d(a, z)), 9(d(z, y))) g( Fr (d(a, z), d(z, y)) 


2 
> g(d(x,y)), 


so gd is an s-dissimilarity. 
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Example 5.5. The function g given by g(x) = In(a# +1) for z >Oisa 
(2, 1)-transformation. Indeed, for a,b > 0 we have the immediate inequality: 


g(Fa(a,b)) = In(Va? +6? +1) 
< In(fa+1)+m(b+1). 


Note that if g is an (r, s)-transformation, then g~! is an (s,r)-transformation. 


Therefore, the function h(x) = e” — 1 is a (1, 2)-transformation. 


Example 5.6. Let a > 0 and let g: R>0 —> Rso be the monotonic function 
defined by g(a) = «° for x > 0. If s < =, then g is an (r, s)-transformation. 
Indeed, we have 


F,(g(a), 9(b)) at 45") 
Fas(a,b))° 

(a, b))° 
by Lemma 5.1) 
= 9(F,(a,b)), 


which shows that g is an (r,s)-transformation. For this transformation 


WV 


( 
( 
(F; 
( 


we have u(S,gd) = urd) Thus, by choosing @ we can modulate the 
ultrametricity of the transformed dissimilarity space. U 


The notion of ultrametricity introduced above involves satisfying In- 
equality (4) for all triangles xyz of the dissimilarity space. Therefore, a few 
triangles in the dissimilarity space which have very different side lengths can 
unduly influence the value of the ultrametricity. This motivates considering 
yet another variant of ultrametricity of dissimilarities. 

Let t = zyz € S® be a triangle in the is a dissimilarity space (5, d). 
Following Lerman’s notation in [9], if d(z,y) > d(x, z) > d(y, z), we write 
Sa(t) = d(z,y) for the longest side of t, Ma(t) = d(a, z) for the middle side 
of t, and La(t) = d(y, z) for the shortest side of the triangle. We consider 
two local variants of ultrametricity of a triangle. 

The strong ultrametricity of t is the number uq(t) = max{r > 0 | 
Sa(t) < F,(Ma(t), La(t))}. The weak ultrametricity of t is the number wa(t) 
given by 

<a if Sa(t) > Ma(t) 
wa(t) = ¢ 182 maw 
oe) if Sg(t) = Ma(t). 
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If wa(t) = 00, then t is an ultrametric triple. 
The weak ultrametricity of the dissimilarity space (S,d) is the number 
w(S,d) defined by 


w(S,d) = median{wa(t) | t € S°}. 
For a triangle t we have 
1 1 
0 < Sy(t) — M,(t) = (2% = 1) M,(t) < (22m = 1) M,(t) 


Thus, if wa(t) is sufficiently large, the triangle t is almost isosceles. For 
example, if wg(t) = 5, the difference between the length of longest side Sq(t) 
and the median side M;(t) is less than 15%. 

For every triangle t € S® in a dissimilarity space we have u(t) < 
wa(t). Indeed, since Sq(t)"4 < Mg(t)"4 + La(t)"@ we have Sq(t)“4 < 
2M,(t)“4, which implies immediately ug(t) < wa(t). 


Theorem 5.7. Let (S,d) be a dissimilarity space and let f : Rso —> Rso 
be a strictly increasing function on Rso. 
If the function g : Rso —>+ Rso given by 


£0) ifa>0, 
g(a) = 
0 ifa=0 


is strictly decreasing, then the function e: Sx S —+ Rso defined by e(x, y) = 
f(d(x,y)) for x,y € S is a dissimilarity and wa(t) < we(t) for every triangle 
te s°. 


Proof. It is immediate that e(x, y) = e(y, x) and e(x, x) = 0 for x,y € S. Let 
t=ayz € S® bea triangle. Since S4(t) > M(t) and g is strictly decreasing, 
g(Sa(t)) < g(Ma(t)), which implies foe < ae Since f is a strictly 
increasing function we have S.(t) = f(Sa(t)) and M.(t) = f(Ma(t)). This 


allows us to write: 
sommes galt) _ Galt). Salt 
Me(t) — f(Ma(t)) ~ Ma(t)’ 


Therefore, 
1 1 
“| ee 
082 Mi) 082 M(t) 
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Example 5.8. Let (5, d) be a dissimilarity space and let e be the dissimilarity 
defined by e(z, y) = d(x, y)", where 0 <r <1. If f(a) =a’, then f is strictly 
increasing and the function g : Ryo —> Rso given by 


F@) ifa > 0, on tas 0, 
g(a) = ; = 
0 ifa=0 0 ifa=0 


is strictly decreasing. Therefore, the weak ultrametricity w(t) is greater 
than wg(t), where e(x, y) = (d(,y))” for z,y € S. 


Example 5.9. Let f : Rs9 —+ Ryo be defined by f(a) = fy. It is easy to 
see that f is strictly increasing on Ryo and 


te test 
a) = l+a 1 , 
g(a) 3 ifa=0 


is strictly decreasing on the same set. Therefore, the weak ultrametricity of 
a triangle increases when d is replaced by e given by 


_ d(x,y) 
e(xz,y) = idea) 


for z,yE S. 


Example 5.10. For a dissimilarity space (.S,d), the Schoenberg transform 
of d described in [3] is the dissimilarity e : $7 —+ Rso defined by 


e(a,y) = 1— eho) 

for z,y € S. Let f : Rso —> Rs be the function f(a) = 1 — e~** that is 
used in this transformation. It is immediate that f is a strictly increasing 
function. For a > 0 we have g(a) = +“—, which allows us to write 


e**(ka+1)—1 


g(a) = - 


for a > 0. Taking into account the obvious inequality ka + 1 < e*® for 
k > 0, it follows that the function g is strictly decreasing. Thus, the weak 
ultrametricity of a triangle relative to the Schoenberg transform is greater 
than the weak ultrametricity under the original dissimilarity. 
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6 Conclusions and Future Work 


Theorem 5.3 highlights the common nature of the results used in two rather 
areas of data mining: obtaining the subdominant ultrametric for a dis- 
similarity through the single-link hierarchical clustering, and obtaining a 
configuration of points in a metric space whose distances approximate object 
dissimilarities. The later process is known as non-metric multidimensional 
scaling. We propose to explore computing the largest metric that approxi- 
mates a dissimilarity between objects without the intermediate calculation 
of a representation of the objects in R” equipped with a Minkowski metric. 
We introduced the notion of ultrametricity of dissimilarities. Trans- 
formations that increase or diminish ultrametricity, decrease or accentuate 
discrepancies between dissimilarity values, respectively. We will examine 
the impact of these transformations on various clustering algorithms and 
classification methods (such as the k'® nearest: neighbor). Another possible 
application of these transformations lies in the area of outlier detection. 
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